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MULTIPLEX GENOTYPING USING SOLID PHASE CAPTURABLE 
DIDEO^tTOCEEOTIDES AND MA SS SPECTROMETRY 
5 ~~ 

This application is a continuation-in-part and claims 
priority of U.S. Serial No. 10/194, 882, filed July 12, 
2002, the contents of which are hereby incorporated by 
reference into this application. 

10 

Background Of The Invention 

Throughout this application, various publications are 
referenced in parentheses. Citations for these 

15 references may be found at the end of the specification 

immediately preceding the claims. The disclosures of 
these publications in their entireties are hereby 
incorporated by reference into this application to more 
fully describe the state of the art to which this 

20 invention pertains. 

Single nucleotide polymorphisms (SNPs), the most common 
genetic variations in the human genome, are important 
markers for identifying disease genes and for 

25 pharmacogenetic studies (1, 2) . SNPs appear in the human 

genome with an average density of once every 1000-base 
pairs (3) . To perform large-scale SNP genotyping, a 
rapid, precise and cost-effective method is required. 
Matrix-assisted laser desorpt ion/ ionization time -of - 

30 flight mass spectrometry (MALDI-TOF MS) (4) allows rapid 

and accurate sample measurements (5-7) and has been used 
in a variety of SNP detection methods including 
hybridization (8-10), invasive cleavage (11, 12) and 
single base extension (SBE) (5, 13-17) . SBE is widely 

35 used for multiplex SNP analysis. In this method, primers 
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designed to anneal immediately adjacent to a polymorphic 
site are extended by a single dideoxynucleotide that is 
complementary to the nucleotide at the variable site. By 
measuring the mass of the resulting extension product, a 
5 particular SNP can be identified. Current SBE methods to 

perform multiplex SNP analysis using MS require 
unambiguous simultaneous detection of a library of 
primers and their extension products. However, 
limitations in resolution and sensitivity of MALDI-TOF MS 
10 for longer DNA molecules make it difficult to 

simultaneously measure DNA fragments over a large mass 
range (6) . The requirement to measure both primers and 
their extension products in this range limits the scope 
of multiplexing. 

15 

A high fidelity DNA sequencing method has been developed 
which uses solid phase capturable biotinylated 
dideoxynucleotides (biotin-ddNTPs ) by detection with 
fluorescence (18) or mass spectrometry (19), eliminating 
0 false terminations and excess primers. Combinatorial 

fluorescence energy transfer tags and biotin-ddNTPs have 
also been used to detect SNPs (20) . 

False stops or terminations occur when a deoxynucleotide 
5 rather than a dideoxynucleotide terminates a se+quencing 

fragment. It has been shown that false stops and primers 
which have dimerized can produce peaks in the mass 
spectra that can mask the actual results preventing 
accurate base identification (21) . 

0 

The present application discloses an approach using solid 
phase capturable biotin-ddNTPs in SBE for multiplex 
genotyping by MALDI-TOF MS. In this method primers that 
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have different molecular weights and that are specific to 
the polymorphic sites in the DNA template are extended 
with biotin-ddNTPs by DNA polymerase to generate 3'- 
biotinylated DNA extension products. The 3 f -biotinylated 

5 DNAs are then captured by streptavidin-coated magnetic 

beads , while the unextended primers and other components 
in the reaction are washed away. The pure DNA extension 
products are subsequently released from the magnetic 
beads, for example by denaturing the biotin-streptavidin 
10 interaction with formamide, and analyzed with MALDI-TOF 

MS. The nucleotide at the polymorphic site is identified 
by analyzing the mass difference between the primer 
extension product and an internal mass standard added to 
the purified DNA products. Since the primer extension 

5 products are isolated prior to MS analysis, the resulting 

mass spectrum is free of non-extended primer peaks and 
their associated dimers, which increases the accuracy and 
scope of multiplexing in SNP analysis. The solid phase 
purification system also facilitates desalting of the 

0 captured oligonucleotides. Desalting is critical in 

sample preparation for MALDI-TOF MS measurement since 
alkaline and alkaline earth salts can form adducts with 
DNA fragments that interfere with accurate peak detection 
(21) . 
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Summary Of The Invention 

This invention is directed to a method for determining 
the identity of a nucleotide present at a predetermined 
5 site in a DNA whose sequence immediately 3' of such 

predetermined site is known which comprises: 

(a) treating the DNA with an oligonucleotide primer 
whose sequence is complementary to such known 
sequence so that the oligonucleotide primer 

10 hybridizes to the DNA and forms a complex in 

which the 3' end of the oligonucleotide primer 
is located immediately adjacent to the 
predetermined site in the DNA; 

(b) simultaneously contacting the complex from step 
15 (a) with four different labeled 

dideoxynucleotides, in the presence of a 
polymerase under conditions permitting a 
labeled dideoxynucleotide to be added to the 3' 
end of the primer so as to generate a labeled 

20 ■ single base extended primer, wherein each of 

the four different labeled dideoxynucleotides 
(i) is complementary to one of the four 
nucleotides present in the DNA and (ii) has a 
molecular weight which can be distinguished 

25 from the molecular weight of the other three 

labeled dideoxynucleotides using mass 

spectrometry; and 

(c) determining the difference in molecular weight 
between the labeled single base extended primer 

30 and the oligonucleotide primer so as to 

identify the dideoxynucleotide incorporated 
into the single base extended primer and 
thereby determine the identity of the 
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nucleotide present* at the predetermined site in 
the DNA. 

In one embodiment, the method further comprises after 
5 step (b) the steps of: 

(i) contacting the labeled single base extended 
primer with a surface coated with a compound 
that specifically interacts with a chemical 
moiety attached to the dideoxynucleotide by a 

0 linker so as to thereby capture the' extended 

primer on the surface; and 

(ii) treating the labeled single base extended 
primer so as to release it from the surface. 



15 in one embodiment, the method further comprises after 

step (i) the step of treating the surface to remove 

primers that have not been extended by a labeled 
dideoxynucleotide . 
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Brief Description Of The Figures 

Figure 1: Scheme of single base extension for multiplex 
SNP analysis using biotin-ddNTPs and MALDI-TOF MS. 
5 Primers that anneal immediately next to the polymorphic 

sites in the DNA template are extended by DNA polymerase 
of a biotin-ddNTP in a sequence-specific manner. After 
solid phase capture and isolation of the 3' -biotinylated 
DNA extension fragments, MALDI-TOF MS was used to analyze 
10 these DNA products to yield a mass spectrum. From the 

relative mass of each extended primer, compared to the 
mass of an internal standard, the nucleotide at the 
polymorphic site is identified. 

15 Figure 2. Multiplex SNP genotyping mass spectra generated 

using biotin-ddNTPs. Inset is a magnified view of 
heterozygote peaks. Masses of the extension product in 
reference to the internal mass standard were listed on 
each single base extension peak. The mass values in 

20 parenthesis indicate the mass difference between the 

extension products and the corresponding primers. (A) 
Detection of six nucleotide variations from synthetic DNA 
templates mimicking mutations in the p53 gene. Four 
homozygous (T, G, C and C) and one heterozygous (C/A) 

25 genotypes were detected. (B) Detection of two 

heterozygotes (A/G and C/G) in the human HFE gene. 

Figure 3: Structure of four mass tagged biotinylated 
ddNTPs. Any of the four ddNTPs (ddATP, ddCTP, ddGTP, 
30 ddTTP) can be used with any of the illustrated linkers. 

Figure 4: Synthesis scheme for mass tag linkers. For 
illustrative purposes, the linkers are labeled to 
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correspond to the specific ddNTP with which they are 
shown coupled in Figures 3, 5, 7, 8 and 9. However, any 
of the three linkers can be used with any ddNTP. 
(i) (CF3CO2O; (ii) Disuccinimidylcarbonate/ 

5 diisopropylethylamine; (iii) Propargyl amine. 

Figure 5: The synthesis of ddATP-Linker-II-ll-Biotin . 

(i) Linker II, tetrakis (triphenylphosphine) palladium(O) ; 

(ii) POCI3, Bn 4 N + pyrophosphate; (iii) NH 4 OH; (iv) Sulfo- 
10 NHS-LC-Biotin . 

Figure 6: DNA products are purified by a streptavidin 
coated porous silica surface. Only the biotinylated 
fragments are captured. These fragments are then cleaved 
15 by light irradiation (hv) to release the captured 

fragments, leaving the biotin moiety still bound to the 
streptavidin . 

Figure 7: Mechanism for the cleavage of photocleavable 
20 linkers. 

Figure 8: The structures of ddNTPs linked to 
photocleavable (PC) biotin. Any of the four ddNTPs 
(ddATP, ddCTP, ddGTP, ddTTP) can be used with any of the 
25 shown linkers. 

Figure 9: The synthesis of ddATP-Linker-II-PC-Biotin . PC 
= photocleavable. 

30 Figure 10: Schematic for capturing a DNA fragment 

terminated with a dideoxynucleoside monophosphate on a 
surface. The dideoxynucleoside monophosphate (ddNMP) 
which is on the 3' end of the DNA fragment is attached 
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via a linker to a chemical moiety "X" which interacts 

with a compound "Y" on the surface to capture the DNA 

fragment terminated with the ddNMP. The DNA fragment can 
be freed from the surface either by disrupting the 

5 interaction between chemical moiety X and compound Y 

(lower scheme) or by cleaving the linker (upper scheme) . 

Figures 11A-11C: Schematic of a high throughput channel 
based purification system. Sample solutions can be 
10 pushed back and forth between the two plates through 

glass capillaries and the streptavidin coated channels in 
the chip. The whole chip can be irradiated to cleave the 
samples after immobilization. 

15 Figure 12: The synthesis of streptavidin coated porous 

surface . 



Figures 13A-13C: Simultaneous detection of nucleotide 
variations in 30 codons of the p53 tumor suppressor gene 

20 by MALDI-TOF MS using solid phase capturable biotinylated 

dideoxynucleotide . Each peak represents a different 
polymorphism labeled with its nucleotide identity and 
absolute mass value. The value in parentheses, denoting 
the mass difference between each DNA extension product 

25 and its corresponding primer, is used to determine the 

nucleotide identity. (A) A mass spectrum from a Wilms 1 
tumor sample showing 30 wild type p53 sequences. (B) A 
mass spectrum from a head and neck tumor (primary tumor 
biopsy) containing a heterozygous genotype G/T (4684/4734 

30 Da) (boxed) in codon 157, corresponding to the wild type 

and mutant alleles, respectively. (C) A mass spectrum 
from a colorectal tumor cell line (HT-29) containing a 
homozygous G to A mutation (boxed) in codon 273 of the 
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p53 gene. The colorectal- tumor cell line (SW-480) 
contained the identical G to A mutation in codon 273. 

Figures 14A-14B: (A) A mass spectrum from a head and neck 
5 tumor sample showing 30 wild type sequences of the p53 

gene. (B) A mass spectrum from a head and neck tumor cell 
line (SCC-4) containing a homozygous C (5881 Da) to T 
(5970 Da) mutation (boxed) in codon 151 of the p53 gene. 
Both spectra were produced using the primers shown in 
10 Table 3 with primer 16 replaced by primer 5 1 - 

TGTGGGTTGATTCCACA-3 9 for detecting the variation in codon 
151 (C/TCC) . 
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Detailed Description Of The Invention 



The following definitions are presented as an aid in 
understanding this invention. 

The standard abbreviations for nucleotide bases are used 
as follows: adenine (A), cytosine (C) , guanine (G) , 
thymine (T) , and uracil (U) . 

A nucleotide analogue refers to a chemical compound that 
is structurally and functionally similar to the 
nucleotide*, i.e. the nucleotide analogue can be 
recognized by polymerase as a substrate. That is, for 
example, a nucleotide analogue comprising adenine or an 
analogue of adenine should form hydrogen bonds with, 
thymine, a nucleotide analogue comprising C or an 
analogue of C should form hydrogen bonds with G, a 
nucleotide analogue comprising G or an analogue of G 
should form hydrogen bonds with C, and a nucleotide 
analogue comprising T or an analogue of T should form 
hydrogen bonds with A, in a double helix format. 

This invention is directed to a method for determining 
the identity of a nucleotide present at a predetermined 
25 site in a DNA whose sequence immediately 3' of such 

predetermined site is known which comprises: 

(a) treating the DNA with an oligonucleotide primer 
whose sequence is complementary to such known 
sequence so that the oligonucleotide primer 
30 hybridizes to the DNA and forms a complex in 

which the 3' end of the oligonucleotide primer 
is located immediately adjacent to the 
predetermined site in the DNA; 



10 



15 



20 
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simultaneousiy contacting the complex from step 
(a) with four different labeled 

dideoxynucleotides, in the presence of a 
polymerase under conditions permitting a 
labeled dideoxynucleotide to be added to the 3' 
end of the primer so as to generate a labeled 
single base extended primer, wherein each of 
the four different labeled dideoxynucleotides 
(i) is complementary to one of the four 
nucleotides present in the DNA and (ii) has a 
molecular weight which can be distinguished 
from the molecular weight of the other three 
labeled dideoxynucleotides using mass 

spectrometry; and 

determining the difference in molecular weight 
between the labeled single base extended primer 
and the oligonucleotide primer so as to 
identify the dideoxynucleotide incorporated 
into the single base extended primer and 
thereby determine the identity of the 
nucleotide present at the predetermined site in 
the DNA . 

In one embodiment, each of the four labeled 
25 dideoxynucleotides comprises a chemical moiety attached 

to the dideoxynucleotide by a different linker which has 
a molecular weight different from that of each other 
linker . 

30 In one embodiment, the method further comprises after 

step (b) the steps of: 

(i) contacting the labeled single base extended 
primer with a surface coated with a compound 
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that specifically interacts with a chemical 
moiety attached to the dideoxynucleotide by a 
linker so as to thereby capture the extended 
primer on the surface; and 

5 (ii) treating the labeled single base extended 

primer so as to release it from the surface. 

In a further embodiment, the method comprises after step 
(i) the step of treating the surface to remove primers 
10 that have not been extended by a labeled 

dideoxynucleotide and any non-captured component. 

In one embodiment of the method step (c) comprises 
determining the difference in mass between the labeled 
15 single base extended primer and an internal mass 

calibration standard added to the extended primer. In 
one embodiment, the internal mass standard is 5'- 
TTTTTCTTTTTCT-3 ' (SEQ ID NO: 5) (MW = 3855 Da) . 

20 In one embodiment, the chemical moiety is attached via a 

different linker to different dideoxynucleotides . In one 
embodiment, the different linkers increase mass 
separation between different labeled single base extended 
primers . and thereby increase mass spectrometry 

25 resolution. 



In one embodiment, the dideoxynucleotide is selected from 

the group consisting of 2' , 3' -dideoxyadenosine 5'- 

triphosphate (ddATP) , 2' , 3' -dideoxyguanosine 5'- 

30 triphosphate (ddGTP) , 2 ' , 3' -dideoxycytidine 5' - 

triphosphate (ddCTP) , and 2 ' , 3' -dideoxythymidine 5' - 
triphosphate (ddTTP) . 
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In different embodiments of the methods described herein, 
the interaction between the chemical moiety attached to 
the dideoxynucleotide by the linker and the compound on 
the surface comprises a biotin-streptavidin interaction, 
5 a phenylboronic acid-salicylhydroxamic acid interaction, 

or an antigen-antibody interaction. 

In one. embodiment, the step of releasing the labeled 
single base extended primer from the surface comprises 

0 disrupting the interaction between the chemical moiety 

attached by the linker to the dideoxynucleotide and the 
compound on the surface. In different embodiments, the 
interaction is disrupted by a means selected from the 
group consisting of one or more of a physical means, a 

5 chemical means, a physical chemical means, heat, and 

light. In one embodiment, the interaction is disrupted 
by light. In one embodiment, the interaction is 

disrupted by ultraviolet light. In different 

embodiments/ the interaction is disrupted by ammonium 

0 hydroxide, formamide, or a change in. pH (-log H + 

concentration) . 



In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 

5 or a structure comprising a chain and one or more rings. 

In different embodiments, the dideoxynucleotide comprises 
a cytosine or a thymine with a 5-position, or an adenine 
or a guanine with a 7-position, and the linker is 
attached to the dideoxynucleotide at the 5-position of 

0 cytosine or thymine or at the 7-position of adenine or 

guanine . 
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In different embodiments , the step of releasing the 
labeled single base extended primer from the surface 
comprises cleaving the linker between the chemical moiety 
and the dideoxynucleotide . In different embodiments, the 
5 linker is cleaved by a means selected from the group 

consisting of one or more of a physical means, a chemical 
means, a physical chemical means, heat, and light. In 
one embodiment, the linker is cleaved by light. In one 
embodiment, the linker is cleaved by ultraviolet light. 
10 In different embodiments, the linker is cleaved by 

ammonium hydroxide, formamide, or a change in pH (-log H + 
concentration) : 

In one embodiment, the linker comprises a derivative of 
15 4-aminomethyl benzoic acid. In one embodiment, the 

linker comprises a 2-nitrobenzyl group or a derivative of 
a 2-nitrobenzyl group. In one embodiment, the linker 
comprises one or more fluorine atoms. 
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In one embodiment, the linker is selected from the group 
consisting of: 




CH 2 NHC(0)CF 3 




CH 2 NHC(0)CF 3 

5 

and 




CH 2 NHC(0)CF 3 

10 In one embodiment, a plurality of different linkers is 

used to increase mass separation between different 
labeled single base extended primers and thereby increase 
mass spectrometry resolution. 



15 



In one embodiment, the chemical moiety comprises biotin, 
the labeled dideoxynucleotide is a biotinylated 
dideoxynucleotide, the labeled single base extended 
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primer is a biotinylated single base extended primer, and 
the surface is a streptavidin-coated solid surface. In 
one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of ddATP-ll-biotin, 
ddCTP-ll-biotin, ddGTP-ll-biotin, and ddTTP-16-biotin . 
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In one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of: 




5 



wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleot ides , or their 
analogues . 




5 



wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides or their 
analogues . 
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In one embodiment, the biotinylated dideoxynucleotide is 
selected from the group consisting of: 



ddCTP 




o 



ddTTP 
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In one embodiment, the streptavidin-coated solid surface 
is a streptavidin-coated magnetic bead or a streptavidin- 
coated silica glass. 

5 In one embodiment of the method, steps (a) and (b) are 

performed in a single container or in a plurality of 
connected containers. 

The invention provides methods for determining the 
identity of nucleotides present at a plurality of 
predetermined sites, which comprises carrying out any of 
the methods disclosed herein using a plurality of 
different primers each having a molecular weight 
different from that of each other primer, wherein a 
different primer hybridizes adjacent to a different 
predetermined site. In one embodiment, different linkers 
each having a molecular weight different from that of 
each other linker are attached to the different 
dideoxynucleotides to increase mass separation between 
different labeled single base extended primers and 
thereby increase mass spectrometry resolution. 

In one embodiment, the mass spectrometry is matrix- 
assisted laser desorption/ionization time-of-f light mass 
25 spectrometry. 

Linkers are provided for attaching a chemical moiety to a 
dideoxynucleotide, wherein the linker comprises a 
derivative of 4-aminomethyl benzoic acid. 

30 

In one embodiment, the dideoxynucleotide is selected from 
the group consisting of 2 ' , 3' -dideoxyadencsine 5'- 



15 



WO 2004/007773 PCT/US2003/021818 

-22- 

triphosphate (ddATP) , 2' , 3' -dideoxyguanosine 5'- 

triphosphate (ddGTP) , 2' ,2' -dideoxycytidine 5' - 

triphosphate (ddCTP) , and 2' , 3' -dideoxy thymidine 5'- 
triphosphate (ddTTP) . 

5 

In one embodiment, the linker comprises one or more 
fluorine atoms. 

In one embodiment, the linker is selected from the group 
10 consisting of: 




CH 2 NHC(0)CF 3 




15 
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In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 
or a structure comprising a chain and one or more rings. 

5 In different embodiments, the linker is cleavable by a 

means selected from the group consisting of one or more 
of a physical means, a chemical means, a physical 
chemical means, heat, and light. In one embodiment, the 
linker is cleavable by ultraviolet light. In different 
10 embodiments, the linker is cleavable by ammonium 

hydroxide, formamide, or a change in pH (-log H + 
concentration) . 

In different embodiments of the linker, the chemical 
15 moiety comprises' biotin, streptavidin or related 

' analogues that have affinity with biotin, phenylboronic 
acid, salicylhydroxamic acid, an antibody, or an antigen. 



In 'different embodiments, the dideoxynucleotide comprises 
20 a cytosine or a thymine with a 5-position, or an adenine 

or a guanine with a 7-position, and the linker is 
attached to the 5-position of cytosine or thymine or to 
the 7-position of adenine or guanine. 

The invention provides for the use of any of the linkers 
described herein in single nucleotide polymorphism 
detection using mass spectrometry, wherein the linker 
increases mass separation between different 

dideoxynucleotides and increases mass spectrometry 
resolution . 



25 



30 



Labeled dideoxynucleotides are provided which comprise a 
chemical moiety attached via a linker to a 5-position of 
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cytosine or thymine or to a 7-position of adenine or 
guanine - 

In one embodiment, the dideoxynucleotide is selected from 
the group consisting of 2' , 3' -dideoxyadenosine 5' - 
triphosphate (ddATP) , 2' , 3' -dideoxyguanosine 5'- 

triphosphate (ddGTP) , 2' ,3' -dideoxycytidine 5' - 

triphosphate (ddCTP) , and * 2 ' , 3' -dideoxythymidine 5'- 
triphosphate (ddTTP) . 



10 



In different embodiments, the linker can comprise a chain 
structure, or a structure comprising one or more rings, 
or a structure comprising a chain and one or more rings. 
In different .embodiments, the linker is cleavable by a 

15 means selected from the group consisting of one or more 

of a physical means, a chemical means, a physical 
chemical means, heat, and light. In one embodiment, the 
linker is cleavable by ultraviolet light. In different 
embodiments, the linker is cleavable by ammonium 

20 hydroxide, formamide, or a change in pH -log [H + 

concentration] . 



In different embodiments of the labeled 

dideoxynucleotide, the chemical moiety comprises biotin, 
25 streptavidin, phenylboronic acid, salicylhydroxamic acid, 

an antibody, or an antigen. 



30 
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In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 




5 



wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent 
four different dideoxynucleotides, or their 
analogues . 
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In one embodiment, the labeled dideoxynucleotide is 
selected from the group consisting of: 




wherein ddNTPl, ddNTP2, ddNTP3, and ddNTP4 represent four 
different dideoxynucleotides, or their analogues. 
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In one embodiment, the labeled dideoxynucleotide has a 
molecular weight of 844, 977, 1,017, or 1,051. In one 
embodiment, the labeled dideoxynucleotide has a molecular 
weight of 1,049, 1,182, 1,222, or 1,257. Other molecular 
5 weights with sufficient mass differences to allow 

resolution in mass spectrometry can also be used. 

In one embodiment the mass spectrometry is matrix- 
assisted laser desorption/ionization time-of-f light mass 
10 spectrometry. 

A system is provided for separating a chemical moiety 
from other components in a sample in solution, which 
comprises : 

15 (a) a channel coated with a compound that 

specifically interacts with the chemical moiety 
at the 3' end of the DNA fragment, wherein the 
channel comprises a plurality of ends; 

(b) a plurality of wells each suitable for holding 
20 the sample; 

(c) a connection between each end of the channel 
and a well; and 

(d) a means for moving the sample through the 
channel between wells. 

25 

In one embodiment of the system, the interaction between 
the chemical moiety and the compound coating the surface 
is a biotin-streptavidin interaction, a phenylboronic 
acid-salicylhydroxamic acid interaction, or an antigen- 
30 antibody interaction. 



In one embodiment, the chemical moiety is a biotinylated 
moiety and the channel is a streptavidin-coated silica 



WO 2004/007773 PCT/US2003/021818 

-30- 

glass channel. In one embodiment, the biotinylated moiety 
is a biotinylated DNA fragment. 

In one embodiment, the chemical moiety can be freed from 
5 the surface by disrupting the interaction between the 

chemical moiety and the compound coating the surface. In 
different embodiments, the interaction can be disrupted 
by a means selected from the group consisting of one or 
more of a physical means, a chemical means, a physical 
10 chemical means, heat, and light. In different 

embodiments, the interaction can be disrupted by ammonium 
hydroxide, formamide, or a change in pH -log [H + 
concentration] . 

In one embodiment, the chemical moiety is attached via a 
linker to another chemical compound. In one embodiment, 
the other chemical compound is a DNA fragment. In one 
embodiment, the linker is cleavable by a means selected 
from the group consisting of one or more of a physical 
means, a chemical means, a physical chemical means, heat, 
and light. In one embodiment, the channel is transparent 
to ultraviolet light and the linker is cleavable by 
ultraviolet light. Cleaving the linker frees the DNA 
fragment or other chemical compound from the chemical 
moiety which remains captured on the surface. 

Multi-channel systems are provided which comprise a 
plurality of any of the single channel systems disclosed 
herein. In one embodiment, the channels are in a chip. 
30 In one embodiment, the multi-channel system comprises 96 

channels in a chip. Chips can also be used with fewer or 
greater than 96 channels. 



15 



20 



25 
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The invention provides for the use of any of the 

separation systems described herein for single nucleotide 
polymorphism detection. 

5 This invention will be better understood from the 

Experimental Details which follow. However, one skilled 
in the art will readily appreciate that the specific 
methods and- results discussed are merely illustrative of 
the invention as described more fully in the claims which 
10 follow thereafter. 
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Experimental Set I 
A. Materials and Methods 

PCR amplification. DNA templates containing the 

polymorphic sites for the human hereditary 
hemochromatosis gene HFE were amplified from genomic DNA 
in a total volume of 10 ^1, that contains 20 ng of 
genomic DNA, 500 pmol each of forward (C282Y; 5'- 
CTACCCCCAGAACATCACC-3' (SEQ ID NO: 1), H63D; 5'- 
GCACTACCTCTTCATGGGTGCC-3 1 (SEQ ID NO: 2)) and reverse 
(C282Y; 5' -CATCAGTCACATACCCCA-3 1 (SEQ ID NO: 3), H63D; 
5'-CAGTGAACATGTGATCCCACCC-3» (SEQ ID NO: 4)) primers, 
25 ^iM dNTPs (Amersham Biosciences, Piscataway, NJ) , 1 U 
Tag polymerase (Life Technologies, Rockville, MD) , and lx 
PCR buffer (50 mM KCl, 1.5 mM MgCl 2 , 10 mM Tris-HCl) . PCR 
amplification reactions were started at 94 °C for 4 min, 
followed by 45 cycles of 94 °C for 30 s, 59 °C for 30 s 
and 72 °C for 10 s, and finished with an additional 
extension step of 72 °C for 6 min. Excess primers and 
dNTPs were degraded by adding 2 U shrimp alkaline 
phosphatase (Roche Diagnostics, Indianapolis, IN) and E. 
Coli exonuclease I (Boehringer Mannheim, Indianapolis, 
IN) in lx shrimp alkaline phosphatase buffer. The 
reaction mixture was incubated at 37 °C for 45 min 
followed by enzyme inactivation at 95 °C for 15 min. 

Single base extension using biotin-ddNTPs . The synthetic 
DNA templates containing six nucleotide variations in p53 
gene and the five primers for detecting these variations 
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are shown in Table 1. These oligonucleotides and an 
internal mass standard (5' -TTTTTCTTTTTCT-3' (SEQ ID NO: 
5), MW = 3855 Da) for MALDI-TOF MS measurement were made 
using an Expedite nucleic acid synthesizer (Applied 
i Biosystems, Foster City, CA) . SBE reactions contained 20 

pmol of primer, 10 pmol of biotin-1 1-ddATP, 20 pmol of 
biotin-ll-ddGTP, 40 pmol of biotin-ll-ddCTP (New England 
Nuclear Life Science, Boston^ MA) , 80 pmol of biotin-16- 
ddUTP (Enzo Diagnostics, Inc., Farmingdale, NY), 2 pi 
) Thermo Sequenase reaction buffer, 1 U Thermo Sequenase in 

its diluted buffer (Amersham Biosciences) and 20 pmol of 
either synthetic template or 10 fil PCR product in a total 
reaction volume of 20 ^1 . For SBE using synthetic 
template 1, 10 pmol of both wild type and mutated 
5 . templates were combined with 2 0 pmol of primers 1 and 3 
or 2 0 pmol of primers 2 and 4. The SBE reaction of 
primer 5 was performed with template 2 in a separate 
tube. PCR products from the HFE gene were mixed with 
20 pmol of the corresponding primers 5'- 

0 GGGGAAGAGCAGAGATATACGT-3' (SEQ ID NO: 6) (C282Y) and 5'- 

GGGGCTCCACACGGCGACTCTC-AT-3 1 (SEQ ID NO: 7) (H63D) in SBE 
to detect the two heterozygous genotypes. All extension 
reactions were thermalcycled for 35 cycles at 94 °C for 
10 s and 49 °C for 30 s. 

5 

Solid phase purification. 2 0 )il of the streptavidin- 
coated magnetic beads (Seradyn, Ramsey, MN) were washed 
with modified binding and washing (B/W) buffer (0.5 mM 
Tris-HCl buffer, 2 M NH 4 C1, 1 mM EDTA, pH 7.0) and 
0 resuspended in 20 jal modified B/W buffer. Extension 

reaction mixtures of primers 1-4 with template 1 and 
primer 5 with template 2 were mixed in a 2:1 ratio, while 
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extension reaction mixtures - from the PCR products of HFE 
gene were mixed in equal amounts. 20 \il of each mixed 
extension product was added to the suspended beads and 
incubated for 1 hour. After capture, the beads were 
washed twice with modified B/W buffer, twice with 0.2 M 
triethyl ammonium acetate (TEAA) buffer and twice with 
deionized water. The primer extension products were 
released from the magnetic beads by treatment with 8 pi 
98 ^% formamide solution containing 2 % 0.2 M TEAA buffer 
at 94 °C for 5 min. The released primer extension 
products were precipitated with 100 % ethanol at 4 °C for 
30 min, and centrifuged at 4 °C and 14000 RPM for 35 min. 

MALDI-TOF MS analysis. The purified primer extension 
products were dried and re-suspended in 1 jxl deionized 
water and 2 ^il matrix solution. The matrix solution was 
made by dissolving . 35 mg of 3-hydroxypicolinic acid (3- 
HPA; Aldrich, Milwaukee, WI) and 6 mg of ammonium citrate 
(Aldrich) in 0 . 8 ml of 50 % acetonitrile . 10 pmol 

internal mass standard in 1 ^il of 50 % acetonitrile was 
then added to the sample. 0.5 ^l of this mixture 
containing the primer extension products and internal 
standard was spotted on a stainless steel sample plate, 
air-dried and analyzed using an Applied Biosystems 
Voyager DE Pro MALDI-TOF mass spectrometer. All 
measurements were taken in linear positive ion mode with 
a 25 kV accelerating voltage, a 94 % grid voltage and a 
350 ns delay time. The obtained spectra were processed 
using the Voyager data analysis package. 
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B. Detection of Single Nucleotide Polymorphism Using 
Biotinylated Dideoxynucleotides and Mass Spectrometry 

5 Solid phase capturable biotinylated dideoxynucleotides 

(biotin-ddNTPs) were used in single base extension for 
multiplex genotyping by mass spectrometry (MS) . In this 
method, oligonucleotide primers that have different 
molecular weights and that are specific to the 

0 polymorphic sites in the DNA template are extended with 

biotin-ddNTPs by DNA polymerase to generate 3 1 - 
biotinylated DNA extension products (Figure 1) . These 
products are then captured by streptavidin-coated solid 
phase magnetic beads, while the unextended primers and 

.5 other components in the reaction are washed away. The 

pure extension DNA products are subsequently released 
from the solid phase and analyzed with matrix-assisted 
laser desorption/ionization time-of -flight MS. The mass 
of the extension DNA products is determined using a 

0 stable oligonucleotide as a common internal mass 

standard. Since only the pure extension DNA products are 
introduced to MS for analysis, the resulting mass 
spectrum is free of non-extended primer peaks and their 
associated dimers, which increases the accuracy and scope 

:5 of multiplexing in single nucleotide polymorphism (SNP) 

analysis. The solid phase purification approach also 
facilitates desalting of the captured oligonucleotides, 
which is essential for accurate mass measurement by MS. 

\ 0 Four biotin-ddNTPs with distinct molecular weights were 

selected to generate extension products that have a two- 
. fold increase in mass difference compared to that with 
conventional ddNTPs. This increase in mass difference 
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provides improved resolution and accuracy in detecting 
heterozygotes in the mass spectrum. 

The "lock and key'' functionality of biotin and 
streptavidin is often utilized in biological sample 
preparation as a way to remove undesired impurities (23) . 
In different embodiments of the methods described herein, 
affinity systems other than biotin-streptavidin can be 
used. Such affinity systems include but are not limited 
to phenylboronic acid-salicylhydroxamic acid (31) and 
antigen-antibody systems. 



.The multiplex genotyping approach was validated by 
detecting six nucleotide variations from synthetic DNA 

15 templates that mimic mutations in exons 7 and 8 of the 

p53 gene. Sequences of the templates and the 

corresponding primers are shown in Table 1 along with the 
masses of the primers and their extension products. The 
mass increase of the resulting single base extension 

20 products in comparison with the primers is 665 Da for 

addition of biotin-ddCTP, 688 Da for addition of biotin- 
ddATP, 704 Da for addition of biotin-ddGTP and 754 Da for 
addition of biotin-ddUTP . The mass data in Table 1 
indicate that the smallest mass difference among any 

25 possible extensions of a primer is 16 Da (between biotin- 

ddATP and biotin-ddGTP) . This is a substantial increase 
over the smallest mass difference between extension 
products using standard ddNTPs (9 Da between ddATP and 
ddTTP) . This mass increase yields improved resolution of 

30 the peaks in the mass spectrum. Increased mass 

difference in ddNTPs fosters accurate detection of 
heterozygous genotypes (15) , since an A/T heterozygote 
with a mass difference of 9 Da using conventional ddNTPs 
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can not be well resolved in the MALDI-TOF mass spectra. 
The five primers for each polymorphic site were designed 
to produce extension products without overlapping masses. 
Primers extended by biot in-ddNTPs were purified and 
5 analyzed by MALDI-TOF MS according to the scheme in 

Figure 1. Extension products of all five primers were 
well-resolved in the mass spectrum free from any 
unextended primers (Figure 2A) , allowing each nucleotide 
variation to be unambiguously identified. Unextended 

10 primers occupy the mass range in the mass spectrum 

decreasing the scope of multiplexing, and excess primers 
can dimerize to form false peaks in the mass spectrum 
(21) . The excess primers and their associated dimers 
also compete for the ion current, reducing the detection 

15 sensitivity of MS for the desired DNA fragments. . These 

complications were completely removed by carrying out SBE 
using biotin-ddNTPs and solid phase capture. Extension 
products for all four biotin-ddNTPs were clearly detected 
with well resolved mass values. The relative masses of 

20 the primer extension products in comparison to the 

internal mass standard revealed the identity of each 
nucleotide at the polymorphic site. In the case of 
heterozygous genotypes, two peaks, one corresponding to 
each allele (C/A) , are clearly distinguishable in the 

25 mass spectrum shown in Figure 2A. 
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Table 1. oligonucleotide primers and synthetic DNA 
templates for detecting mutations in the p53 gene. 
(Top) The sequences and the calculated masses of 
primers and the four possible single base extension 
products relative to the internal mass standard are 
listed. The bold numbers refer to the nucleotide 
variations detected in the p53 gene. (Bottom) The six 
nucleotide variations in template 1 and 2 are shown 
in bold letters. Template 1 contains a heterozygous 
genotype (G/T) . Primers 1-5 = SEQ ID NOs : 8-12, 
respectively. 



Primers 


Primer sequences 


Masses 
(Da) 


Masses of single base extension 
products (Da) 


Biotin- 
ddCTP 

A665 


Biotin- 
ddATP 

A688 


Biotin- 
ddGTP 

A704 


Biotin- 
ddUTP 

A754 


1 


5' -AGAGGATCCAACCGAGAC-3' 


1656 


2321 


2344 


2360 


2410 


2 


5' - 


3350 


4015 


4038 


4054 


4103 


3 


TGGTGGTAGGTGATGTTGATGTA- 
3' 


2833 


3498 


3521 


3538 


3587 


4 


5'- 


2134 


2799 


2822 


2838 


2480 


5 


CACATTGTCAAGGACGTACCCG- 3 ' 


2507 


3172 


3195 


3211 


3261 




5' -TACCCGCCGTACTTGGCCTC- 
3' 














5' -TCCACGCACAAACACGGACAG- 














3' 













Templates 


Template sequences 


1 
2 


5' - 

TACCCG/TGAGGCCAAGTACGGCGGGTACGTCCTTGACAATGTGTACATCAACATCACCTACCACCATGT 
CAGTCTCGGTTGGATCCTCTATTGTGTCCGGG- 3 1 (SEQ ID NO: 13) 

5'- 

GAAGGAGACACGCGGCCAGAGAGGGTCCTGTCCGTGTTTGTGCGTGGAGTTTCGACAAGGCAGGGTCAT 
CTAATGGTGATGAGTCCTATCCTTTTCTCTTCGTTCTCCGT-3 * (SEQ ID NO: 14) 
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One advantage of MALDI-TOF MS in comparison to other 
detection techniques is its ability to simultaneously 
measure masses of DNA fragments over a certain range. 



5 In order to explore this feature to detect multiple SNPs 

in a single spectrum, if unextended primers are not 
removed, masses of all primers and their extension 
products must have sufficient differences to yield 
adequately resolved peaks in the mass spectrum, Ross et 

10 al- simultaneously detected multiple SNPs by carefully 

tuning the masses of all primers and extension products 
so that they would lie in the range of 4.5 kDa and 
7.6 kDa without overlapping (14). Since the unextended 
primers occupy the mass range in the mass spectrum, by 

15 eliminating them, the approach disclosed herein will 

significantly increase the scope of multiplexing in SNP 
analysis . 

To demonstrate the ability of this method to discriminate 
20 SNPs in genomic DNA, two disease associated SNPs were 

genotyped in the human hereditary hemochromatosis (HHC) 
gene HFE. HHC is a common genetic condition in 

Caucasians with approximately 1/400 Caucasians homozygous 
for the C282Y mutation leading to iron overload and 
25 potentially liver failure, diabetes and depression (22) . 

A subset of individuals who are compound heterozygotes 
for the C282Y and H63D mutations also manifest iron 
overload. Because of the high prevalence of these 
mutations and the ability to prevent disease 
30 manifestations by phlebotomy, accurate methods for 

genotyping these two SNPs will foster genetic screening 
for this condition. Two PCR products were generated from 
human genomic DNA for the C282Y and H63D polymorphic 
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sites of the HFE gene and then used these products for 
SBE with biotin-ddNTPs . After the extension reaction, 
products were purified using solid phase capture 
according to the scheme in Figure 1 and analyzed by 
MALDI-TOF MS. The mass spectrum obtained from this 
experiment is shown in Figure 2B. Extension products of 
each primer were readily identified by their mass 
. relative to the internal mass standard. Heterozygous 
genotypes of A/G and C/G with a mass difference of 16 Da 
i and 39 Da respectively were accurately detected at the 

C282Y and* H63D polymorphic sites. 

These results indicate that the use of solid phase 
capturable biotin-ddNTPs in SBE, coupled with MALDI-TOF 

i MS detection, provides a rapid and accurate method for 

multiplex SNP detection over broad mass ranges and should 
greatly increase the number of SNPs that can be detected 
simultaneously. In multiplex SBE reactions, the 

oligonucleotide primers and their dideoxynucleotide 

) extension products differ by only one base pair, which 

requires analytical techniques with high resolution to 
resolve. In addition, a primer designed to detect one 
polymorphism and an extension product from another 
polymorphic site may have the same size, which can not be 

5 separated by electrophoresis and other conventional 

chromatographic or size exclusion methods. Methods for 
purifying DNA samples using the strong interaction of 
biotin and streptavidin are widely used (23-27) . By 
introducing the biotin moiety at the 3' end of DNA, the 

0 solid phase based affinity purification approach 

described here is a unique and effective method to remove 
the oligonucleotide primers from the dideoxynucleotide 
extension products. 
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To increase the stability of DNA fragments for MALDI-TOF 
MS measurement in multiplex SNP analysis, nucleotide 
analogues (28) and peptide nucleic acid (9) can be used 
in the construction of the oligonucleotide primers. It 
5 has be'en shown that MALDI-TOF MS could detect DNA 

fragments up to 100 bp with sufficient resolution (29).. 
The mass difference between each adjacent DNA fragment is 
approximately 300 Da. Thus, with a mass difference of 
100 Da for each primer in designing a multiplex SNP 
10 analysis project, at least 300 SNPs can be analyzed in a 

single spot of the sample plate by MS. It is a routine 
method now to place 384 spots in each sample plate in MS 
analysis. Thus, each plate can produce over 100,000 
SNPs, which is roughly the entire SNPs in all the coding 
15 regions of the human genome. This level of multiplexing 

should be achievable by mass tagging the primers with 
stable chemical groups in SBE using biotin-ddNTPs . For 
SNP sites of interest, a master database of primers and 
the resulting masses of all four possible extension 
20 products can be constructed. The experimental data from 

MALDI-TOF MS can then be compared with this database to 
precisely identify the library of SNPs automatically. 
This method coupled with future improvements in mass 
spectrometer detector sensitivity (30) will provide a 
25 platform for high-throughput SNP identification unrivaled 

in speed and accuracy. 

C. Design and Synthesis of Biotinylated 

dideoxynucleotides with Mass Tags 

The ability to distinguish various bases in DNA using 
mass spectrometry is dependent on the mass differences of 
the bases in the spectra. For the above work, the 



30 
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smallest difference in mass between any two nucleotides 
is 16 daltons (see Table 1). Fei et al . (15) have shown 
that using dye-labeled ddNTP paired with a regular dNTP 
to space out the mass difference, an increase in the 
5 detection resolution in a single nucleotide extension 

assay can be achieved. To enhance the ability to 
distinguish peaks in the spectra, the current application 
discloses systematic modification of the biotinylated 
dideoxynucleotides by incorporating mass linkers 
0 assembled using 4-aminomethyl benzoic acid derivatives to 

increase the mass separation of the individual bases. The 
mass linkers can be modified by incorporating, one or two 
fluorine atoms to further space out the mass differences 
between the nucleotides. The structures of four 

5 biotinylated ddNTPs are shown in Figure 3. ddCTP-11- 

biotin is commercially available (New England Nuclear, 
Boston) . ddTTP-Linker I-ll-Biotin, ddATP-Linker 11-11- 
Biotin and ddGTP-Linker III-ll-Biotin are synthesized as 
shown, for example, for ddATP-Linker II-ll-Biotin in 
0 Figure 5. In designing these mass tag linker modified 

biotinylated ddNTPs, the linkers are attached to the 5- 
position on the pyrimidine bases (C and T) , and to the 7- 
position on the purines (A and G) for subsequent 
conjugation with biotin. It has been established that 
5 modification of these positions on the bases in the 

nucleotides, even with bulky energy transfer fluorescent 
dyes, still allows efficient incorporation of the 
modified nucleotides into the DNA strand by DNA 
polymerase (32, 33). Thus, the ddNTPs-Linker-ll-biotin 
0 can be incorporated into the growing strand by the 

polymerase in DNA sequencing reactions. 
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Larger mass separations will greatly aid in longer read 
lengths where signal intensity is smaller and resolution 
is lower. The smallest mass difference between two 
individual bases is over three times as great in the mass 
tagged biotinylated ddNTPs compared to normal ddNTPs and 
more than double that achieved by the standard 
biotinylated ddNTPs as shown in Table 2. 



Table 2. Relative mass differences (daltons) 
dideoxynucleotides using ddCTP as a reference. 



Base 


Standard 
ddNTP 


Commercial 
Biotinylated 
ddNTP 


Biotinylated ddNTP 
with mass tag 
linker 


C relative 
to C 


0 


0 


0 (no linker) 


T relative 
to C 


15 


89 (16 
linker) 


125 (Linker I) 


A relative 
to C 


24 


24 


165 (Linker II) 


G relative 
to C 


40 


40 


200 (Linker III) 


Smallest 
relative 
difference 


9 


16 


35 



Three 4-aminomethyl benzoic acid derivatives Linker I, 
15 Linker II and Linker III are designed as mass tags as 

well as linkers for bridging biotin to the corresponding 
dideoxynucleotides. The synthesis of Linker II (Figure 
4) is described here to illustrate the synthetic 
procedure. 3-Fluoro-4-aminomethyl benzoic acid that can 
20 be easily prepared via published procedures (41, 42) is 

first protected with trif luoroacetic anhydride, then 
converted to N-hydroxysuccinimide (NHS) ester with 
disuccinimidylcarbonate in the presence of 

diisopropylethylamine. The resulting NHS ester is 

25 subsequently coupled with commercially available 
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propargylamine to form the desired compound, Linker II. 
Using an analogous procedure, Linker I and Linker III can 
be easily constructed. 

5 Figure 5 describes the scheme required to prepare 

biotinylated ddATP-Linker II-ll-Biotin using well- 
established procedures (34-36). 7-I-ddA is coupled with • 
•linker II in the presence of tetrakis ( triphenylphosphine) 
palladium (0) to produce 7-Linker II-ddA, which is 

10 phosphorylated with P0C1 3 in butylammonium pyrophosphate 

(37). After removing the trif luoroacetyl group with 
ammonium hydroxide, 7-Linker II-ddATP is produced, which 
then couples with sulf o-NHS-LC-Biotin (Pierce, Rockford 
IL) to • yield the desired ddATP-Linker II-ll-Biotin. 

15 Similarly, ddTTP-Linker I-ll-Biotin, and ddGTP-Linker 

III-ll-Biotin can be synthesized. 

D. Design and Synthesis of Mass Tagged ddNTPs Containing 
Photocleavable Biotin 

20 

A schematic of capture and cleavage of the photocleavable 
linker on the streptavidin coated porous surface is shown 
in Figure 6. At the end of the reaction, the reaction 
mixture consists of excess primers, enzymes, salts, false 

25 stops, and the desired DNA fragment. This reaction 

mixture is passed over a streptavidin-coated surface and 
allowed to incubate. The biotinylated fragments are 
captured by the streptavidin surface, while everything 
else in the mixture is washed away. Then the fragments 

30 are released into solution by cleaving the photocleavable 

linker with near ultraviolet (UV) light, while the biotin 
remains attached to the streptavidin that is covalently 
bound to the surface. The pure DNA fragments can then be 
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crystallized in matrix solution and analyzed by mass 
spectrometry. It is advantageous to cleave the biotin 
moiety since it contains sulfur which has several 
relatively abundant isotopes. The rest of the DNA 
5 fragments and linkers contain only carbon, nitrogen, 

hydrogen, oxygen, fluorine and phosphorous, whose 
dominant isotopes are found with a relative abundance of 
99% to 100%. This allows high resolution mass spectra to 
be obtained. The photocleavage mechanism (38, 39) is 

10 shown in Figure 7. Upon irradiation with ultraviolet 

light at 300-350 ran, the light sensitive o-nitroaromatic 
carbonamide functionality on DNA fragment 1 is cleaved, 
producing DNA fragment 2, PC-biotin and carbon dioxide. 
The partial chemical linker remaining on DNA fragment 2 

15 is stable for detection by mass spectrometry. 

Four new biotinylated ddNTPs disclosed here, ddCTP-PC- 
Biotin, ddTTP-Linker I-PC-Biotin, ddATP-Linker II-PC- 
Biotin and ddGTP-Linker III-PC-Biotin are shown in Figure 

20 8. These compounds are synthesized by a similar 

chemistry as shown for the synthesis of ddATP-Linker II- 
11-Biotin in Figure 6. The only difference is that in 
the final coupling step NHS-PC-LC-Biotin (Pierce, 
Rockford IL) is used, as shown in Figure 9. The 

25 photocleavable linkers disclosed here allow the use of 

solid phase capturable terminators and mass spectrometry 
to be turned into a high throughput technique for DNA 
analysis . 



30 
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E. Overview of capturing a DNA fragment terminated with a 
ddNTP on a surface and freeing the ddNTP and DNA fragment 

The DNA fragment is terminated with a dideoxynucleoside 
5 monophosphate (ddNMP) . The ddNMP is attached via a 

linker to a chemical moiety ("X" in Figure 10) . The DNA 
fragment terminated with ddNMP is captured on the surface 
through interaction between chemical moiety "X" and a 
compound on or attached to the surface ("Y" in Figure 
0 10) . The present application discloses two methods for 

freeing the captured DNA fragment terminated with ddNMP. 
In the situation illustrated in the lower part of Figure 
10, the DNA fragment terminated with ddNMP is freed from 
the surface by disrupting or breaking the interaction 
.5 between chemical moiety "X" and compound "Y". In the 

upper part of Figure 10, the DNA fragment terminated with 
* ddNMP is attached to chemical moiety "X" via a cleavable 
linker which can be cleaved to free the DNA fragment 
terminated with ddNMP. 

20 

Different moieties and compounds can be used for the "X" 
- "Y" affinity system, which include but are not limited 
to, biotin-streptavidin, phenylboronic acid- 

salicylhydroxamic acid (31) , and antigen-antibody 
25 systems. 

In different embodiments, the cleavable linker can be 
cleaved and the "X" - "Y" interaction can be disrupted by 
a means selected from the group consisting of one or more 
30 of a physical means, a chemical means, a physical 

chemical means, heat, and light. In one embodiment, 

ultraviolet light can be used to cleave the cleavable 
linker. Chemical means include, but are not limited to, 
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ammonium hydroxide (40), formamide, or a change in pH (- 
log H + concentration) of the solution. 

F. High density streptavidin-coated, porous silica 
5 channel system. 

Streptavidin coated magnetic beads are not ideal for 
using the photocleavable biotin capture and release 
process for DNA fragments, since they are not transparent 

0 to UV light. Therefore, the photocleavage reaction is not 

efficient. For efficient capture of the biotinylated 
fragments, a' high-density surface coated with 
streptavidin is essential. It is known that the 

commercially available 96-well streptavidin coated plates 

.5 " cannot provide a sufficient surface area for efficient 
• capture of "the biotinylated DNA fragments. Disclosed in 
this application is a porous silica channel system 
designed to* overcome this limitation. 



0 To increase the surface area available for solid phase 

capture, porous channels are coated with a high density 
of streptavidin. For example, ninety-six (96) porous 
silica glass channels can be etched into a silica chip 
(Figure 11) . The surfaces of the channels are modified 

5 to contain streptavidin as shown in Figure 12 . The 

•channel is first treated with 0.5 M NaOH, washed with 
water, and then briefly pre-etched with dilute hydrogen 
fluoride. Upon cleaning with water, the capillary channel 
is coated with high density 3-aminopropyltrimethoxysilane 

0 in aqueous ethanol (43) . An excess of disuccinimidyl 

glutarate in N, N-dimethylf ormamide (DMF) is then 
introduced into the capillary to ensure a highly 
efficient conversion of the surface end group to a 
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succinimidyl ester. Streptavidin is then conjugated with 
the succinimidyl ester to form a high-density surface 
using excess streptavidin solution. The resulting 96- 
channel chip is used as a purification cassette. 

A 96-well plate that can be used with biotinylated 
terminators for DNA analysis is shown in Figure 11. In 
the example shown, each end of a channel is connected to 
a single well. However, for other applications, the end 
0 of a channel could be connected to a plurality of wells. 

Pressure is applied to drive the samples through a glass 
capillary into the channels on the chip. Inside the 
channels the biotin is captured by the covalently bound 
streptavidin. After passing through the channel, the 
5 sample enters into a clean plate in the other end of the 

chip. Pressure applied in reverse drives the sample 
through the channel multiple times and ensures a highly 
efficient solid phase capture. Water is similarly added 
to drive out the reaction mixture and thoroughly wash the 
0 captured fragments. After washing, the chip is 

irradiated with ultraviolet light to cleave the 
photosensitive linker and release the DNA fragments. The 
fragment solution is then driven out of the channel and 
into a collection plate. After matrix solution is added, 
the samples are spotted on a chip and allowed to 
crystallize for detection by MALDI-TOF mass spectrometry. 
The purification cassette is cleaned . by chemically 
cleaving the biotin-streptavidin linkage, and is then 
washed and reused. 
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Experimental Set II 

A. Synopsis 

5 The following experiments show the simultaneous 

genotyping of 30 nucleotide variations in the p53 gene 
from human tumors in one tube, by using solid phase 
capturable dideoxynucleotides to generate single base 
extension products which are detected by mass 

10 spectrometry. Both homozygous and heterozygous genotypes 

are accurately determined with digital resolution. This 
is the highest level of SNP multiplexing reported thus 
far using mass spectrometry, indicating the approach will 
have wide applications in screening a repertoire of 

15 genotypes in candidate genes as potential markers for 

cancer and other diseases. 



B. Introduction 



20 With the completion of the Human Genome Project, a stage 

has been set to screen genetic mutations for identifying 
disease genes in a genomewide scale (44). Matrix-assisted 
laser desorption/ionization time-of -flight mass 

spectrometry (MALDI-TOF MS), which allows rapid DNA 

25 sample measurement yielding digital data, has been 

explored to detect single nucleotide polymorphisms (SNPs) 
using invasive cleavage (11) and primer-directed base 
extension (14, 45) . Conventional single base extension 
(SBE) methods using MS to measure multiplex SNPs require 

30 unambiguous simultaneous detection of a library of 

primers and their extension products. However, 
limitations in resolution and sensitivity of MALDI-TOF MS 
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for longer DNA molecules make it difficult to 
simultaneously measure DNA fragments over a large mass 
range. The requirement to measure both primers and their 
extension products in this range limits the scope of 
5 multiplexing. The use of MALDI-TOF MS and molecular 

affinity for multiplex digital SNP detection using solid 
phase capturable (SPC) dideoxynucleotides and SEE has 
recently been explored, establishing the feasibility of 
simultaneously measuring 20 SNPs in synthetic DNA 

10 templates (4 6) . This study shows the simultaneous 

genotyping of 30 nucleotide variations, corresponding to 
known sites of cancer-associated somatic mutations, in 
exons 5, 7 and 8 of the p53 gene from human tumors in one 
tube using the SPC-SBE method. This is the highest level 

15 of multiplexing reported thus far using mass spectrometry 

for SNP analysis. 



C. Materials and Methods 



20 



Multiplex PCR and single base extension reactions 

Multiplex PCR was performed to amplify 3 regions in exons 
' 5, 7 and 8 of the p53 gene. The primers for each region 
were 5 1 -TATCTGTTCACTTGTGCCC-3 ' (exon 5, forward), 5 f - 
CAGAGGCCTGGGGA-CCCTG-3 1 (exon 5, reverse), 5'- 

25 CTGCTTGCCACAGGTCTC-3 ' (exon 7, forward), 5 1 -CACAGCAG- 

GCCAGTGTGC-3 1 (exon 7, reverse), 5 1 -GGACCTGATTTCCTTAC-TG- 
3' (exon 8, forward), and 5 1 -TGAATCTGAGGCATAACTG-3 1 (exon 
8, reverse). The 45 1 PCR reaction consisted of 180 ng 
genomic DNA, 1.5 nmol dNTP, 4.5 1 10X PCR buffer, 15 mM 
MgCl 2 , 4 pmol of forward and reverse primers for exons 5 
and 7, 6 pmol of forward and reverse primers for exon 8, 
and 1.0 U of JumpStart RedAccuTaq DNA Polymerase. After 



30 
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a 5 min 96 °C hot start, the touchdown PCR program was 
performed with 10 cycles of 96 °C (30 sec), 67 *C to 57 °C 
(-1.0 °C per cycle, 30 sec) and 72 °C (30 sec), an 
additional 30 cycles of 96 °C (30 sec), 57 °C (30 sec) and 
72 °C (30 sec), and a final extension at 72 °C for 7 min. 
The 30 SBE primers (Table 3) were designed to yield 
extension products with a sufficient mass difference and 
to be extended simultaneously in a single tube. Primer 
sequences were designed to avoid any overlap in mass, and 
the formation of secondary structures. To evenly 

separate the masses of such a large number of primers for 
SBE, some primers were synthesized using methyl-dC and dU 
phosphoramidites (Glen Research) to replace dC and dT 
respectively. Substitution of dC by methyl-dC increased 
the primer mass by 14 Da whereas a change from dT to dU 
decreased the mass by 14 Da. Primers were synthesized 
using an Applied Biosystems DNA synthesizer. The 
procedures for the SBE, solid phase purification and 
MALDI-TOF MS measurement were performed as described (Kim 
et ai., Analytical Biochemistry 2003, 316, 251). Direct 
DNA sequencing was conducted using energy transfer 
terminator chemistry and a MegaBACE 1000 capillary DNA 
sequencer (Amersham Bioscience) . 



Discussion 



Thirty polymorphic sites, including the most frequently 
mutated p53 codons, were chosen to explore the high 
multiplexing scope of the SPC-SBE method (Figure 1) . 
Thirty primers specific to each polymorphic site were 
designed to yield SBE products with sufficient mass 
differences. This was achieved by tuning the mass of 
some primers using methyl-dC and dU to replace dC and dT, 
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respectively . Human genomic DNA was amplified by 

multiplex PCR to produce amplicons of three p53 exons. 
The 30 primers were mixed with the PCR products and 
biotinylated dideoxynucleotides for SBE to generate 3'- 
5 biotinylated extension DNA products. These products were 

then captured by streptavidin-coated solid phase magnetic 
beads, while the unextended primers and other components 
in the reaction were washed away. The pure DNA products 
were subsequently released from the solid phase and 

10 analyzed by MALDI-TOF MS. The nucleotide at the 

polymorphic site is accurately identified by the mass of 
the DNA extension product in a mass spectrum. Since only 
the DNA extension products are isolated for MS analysis, 
the resulting mass spectrum is free of non-extended 

15 primer peaks and their associated dimers, increasing 

accuracy and scope of multiplexing. The solid phase 
purification also facilitates desalting of the captured 
DNA, a process that is critical for accurate mass 
measurement by MALDI-TOF MS. 

20 

The SPC-SBE genotyping approach was used to analyze 
nucleotide variations in 30 codons of 3 exons of the p53 
gene from 30 Wilms' tumors, 19 head and neck squamous 
carcinomas and 3 colorectal carcinomas. Primer sequences 

25 are shown in Table 3 along with the masses of the primers 

and their extension products. Extension products of all 
30 primers were resolved in the mass spectrum, free from 
any unextended primers, yielding digital data to 
unambiguously determine each nucleotide variation 

30 (Figures 13A-13C) . Unextended primers occupy the mass 

range in the mass spectrum decreasing the scope of 
multiplexing, and excess primers can dimerize to form 
false peaks in the mass spectrum (21) . The excess primers 
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and their associated dimers also compete for the ion 
current, reducing the detection sensitivity of MS for the 
desired DNA fragments. These complications were 

completely removed in the SPC-SBE method. When using 
5 conventional ddNTPs, the mass difference between ddATP 

and ddTTP is 9 Da, which is difficult to resolve by 
MALDI-TOF MS (15) . In the SPC-SBE method using 

biotinylated ddNTPs, the difference between A and T is 
increased to 66 Da, which fosters accurate detection of 
10 heterozygous genotypes. 



None of the 30 Wilms' tumor samples showed somatic 
mutations for the 30 polymorphic sites tested, yielding 
30 distinct peaks corresponding to the wild type p53 

15 sequences in a mass spectrum (Figure 13A) . In contrast, 

two of the 19 head and neck tumor samples contained a 
genetic variation; one at codon 157 (G/T heterozygous 
configuration; primary tumor biopsy; Figure 13B) and the 
other at codon 151 (C to T homozygous; squamous carcinoma 

20 cell line; Figure 14). In the three colorectal tumor 

cell lines tested, one (HCT-116) had 30 wild type p53 
sequences for the 30 sites, yielding a mass spectrum 
similar to the one shown in Figure 13A, while the other 
two (HT-29 and SW-480) had a G to A homozygous mutation 

25 in codon 273 (Figure 13C) . Both heterozygous and 

homozygous genotypes were clearly detected in the 30 
codons with great accuracy. The G/T heterozygote 

(4684/4734 Da) was shown with two peaks corresponding to 
the wild type and mutant alleles, respectively (Figure 

30 13B) . These data, confirmed by direct DNA sequencing, 

are consistent with the known paucity of the p53 
mutations in Wilms' tumor, and the known occurrence of 
such mutations in squamous carcinomas and colorectal 



WO 2004/007773 



-54- 



PCT/US2003/021818 



carcinomas . 

It has been reported that MALDI-TOF MS could detect DNA 
sequencing fragments up to 100 bp with sufficient 
resolution using cleavable primers (29).. The mass 
5 difference between each adjacent DNA sequencing fragment 

is approximately 300 Da.' In principle, with a mass 
difference of 100 Da for each primer in designing a 
multiplex SNP analysis project using the SPC-SBE method, 
at least 300 SNPs can be analyzed in a single spot of an 

10 MS sample plate. Thus, each MS sample plate with 384 

spots can produce over 100,000 SNPs, which is roughly the 
number of tag . SNPs required to identify all the 
haplotypes in the human genome. This level of 

multiplexing should be achievable by mass tuning the 

15 primers with nucleotide analogues containing stable 

chemical groups (28) . It is anticipated that the SPC-SBE 
high-throughput digital SNP detection approach will have 
wide applications in screening a repertoire of genotypes 
in candidate genes as potential markers for cancer and 

20 other diseases. 
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Table 3. Thirty p53 codons and the corresponding 30 SBE primers. 
The position of the nucleotide variation tested in each codon is 
shown in bold. The primer sequence and modification is specified 
and the modified nucleotides are shown in bold. The mass of each 
primer is indicated along with the mass of all four possible SBE 
products. The mass values in bold specify the wild type nucleotide 
sequences (ddNTP-B = Biotinylated dideoxynucleotides ) . 



Primer 
Number 


Exon 


Codon 


1 


5 


"179 (CAT) 


2 


5 


157 (GTC) 


3 


5 


179 (CAT) 


4 


5 


163(1 AC) 


5 


5 


158 (CGC) 


6 


7 


248 (CG(i) 


7 


5 


132 (AAG) 


8 


8 


298 (GAG) 


9 


8 


285 (GAG) 


10 


5 


161 (GCC) 


11 


7 


249 (AGG) 


12 


8 


266 (GGA) 


13 


8 


286 (GAA) 


14 


7 


258 (GAA) 


15 


5 


176 (IGC) 


16 


5 


152 (CCG) 


17 


8 


273 (CG I ) 


18 


7 


234 (TAC) 


19 


7 


248 (CGG) 


20 


7 


249 (AGG) 


21 


8 


282 (CGG) 


22 


8 


278 (CCT) 


23 


5 


135 (IGC) 


24 


7 


245 (GGC) 


25 


7 


237 (AIG) 


26 


7 


242 (TGC) 


27 


7 


241 (TCC) 


28 


8 


275 (TOT) 


29 


5 


141 (TCC) 


30 


5 


175 (CGC) 



Sequences (5'-3') 



Modification Primer 
Mass (Da) 



Mass of Single Base Extention Products (D 
ddGTP-B ddUTP-B 



GCGCTGCCCCCAC " None 3857 

GCCC GGC ACCC GC methyl C 3980 

GCGCTGCCCCCACC None 4146 

CGCCATGGCCATCT methyl C 4270 

CCGGCACCCGCGTCC None 4475 

TGGGCGGCATGAACC None 4618 

TCCCCTGCCCTCAACA methyl C 4736 

AGGGGAGCCTCACCAC None 4876 

GAGAGA CC GGC GC AC A methyl C 4995 

CCCGCGTCCGCGCCATG None 5108 

GGCGGCATGAACCGGAG methyl C 5341 

GTAGTGGTAATCTACTGG dU 5486 

AGAGACC GGCGCACAGAG methyl C 5638 

C CTC ACC ATC ATC AC ACTG methyl C 5765 

ACGGAGGT TGTGAGGCGCT dU 5897 

GTGGGTTGATTCCACACCCC dU 6041 

ACGGAACAGCTTTGAGGTGC . None 6182 

CTGACTGTACCACCATCCACT None 6286 

TCCTGCATGGGCGGCATGAAC dU 6405 

GCATGGGCGGCATGAACCGGA None 6521 

TTGTGCC TGTCC TGGGAGAGAC dU 6698 

TGAGGTGCGTGTTTGTGCCTGT None 6819 

CCCTGCCCTCAACAAGATGTTTT None 6935 

TGTG TAACAGTTCCTGCATGGGC dU 7043 

TACCACCATCCACTACAACTACAT None 7170 

ACAAC TACATGTGTAACAGTTCCT dU 7282 

ACTAC AAC TAC ATGTGTAAC AGTT methyl C 7390 

GGAACAGCTTTGAGGTGCGTGTTT methyl C 7497 

ATGTTTTGCCAACTGGCCAAGACCT None 7617 

CAGCACATGACGGAGGTTGTGAGGC None 7772 



ddATP-B ddCTP-B 

4545 4522 

4668 4645 

4834 4811 

4958 4935 

5163 5140 

5306 5283 

5424 5401 

5564 5541 

5683 5660 

5796 5773 

6029 6006 

6174 6151 

6326 6303 

6453 6430 

6585 6562 

6729 6706 

6870 6847 

6974 6951 

7093 707Q 

7209 7186 

7386 7363 

7507 7484 

7623 7600 

7731 7708 

7858 7835 

7970 7947 

8078 8055 

8185 8162 

8305 8282 

8460 8437 



4561 

4684 

4850 

4974 

5179 

5322 

5440 

5S80 

5699 

5812 

6045 

6190 

6342 

6469 

6601 

6745 

6886 

6990 

7109 

7225 

7402 

7523 

7639 

7747 

7874 

7986 

8094 

8201 

8321 

8476 



4611 

4734 

4900 

5024 

5229 

5372 

5490 

5630 

5749 

5862 

6095 

6240 

6392 

6519 

6651 

6795 

6936 

7040 

7159 

7275 

7452 

7573 

7689 

7797 

7924 

8036 

8144 

8251 

8371 

8526 
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